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We use the traveling front approach to derive exact asymptotic results for the statistics of the 
number of particles in a class of directed diffusion limited aggregation models on a Cayley tree. 
We point out that some aspects of these models are closely connected to two different problems 
in computer science, namely the digital search tree problem in data structures and the Lempel-Ziv 
algorithm for data compression. The statistics of the number of particles studied here is related 
to the statistics of height in digital search trees which, in turn, is related to the statistics of the 
length of the longest word formed by the Lempel-Ziv algorithm. Implications of our results to these 
computer science problems are pointed out. 
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I. INTRODUCTION 

The simple model of diffusion limited aggregation 
(DLA), ever since its introduction by Witten and Sander 
in 1981 [1], has continued to play a central role in under- 
standing the fractal growth phenomena. Besides raising a 
number of conceptual issues regarding pattern formation, 
this model has also found numerous applications in phys- 
ical processes ranging from dielectric breakdown [2] and 
Hele-Shaw fluid flow [3] to electrodeposition [4] and den- 
dritic growth [5]. In the simplest version of this model, 
one considers, for example, a square lattice where the 
origin is a seed. Particles are released sequentially from 
the boundary. Each particle performs a random walk in 
space and when it comes in contact with the growing clus- 
ter around the central seed, it sticks to the cluster and 
thus the cluster grows. This growing DLA cluster has a 
fractal structure with many branches that are separated 
by deep 'fjords'. Despite various advances, characterizing 
this fractal pattern quantitatively has remained a major 
theoretical challenge for the past two decades [6]. One 
clear picture that has emerged out of various studies is 
that the key effect responsible for this complex pattern is 
the dynamical 'screening' : a newly arriving particle has 
more probability to attach to the 'tip' sites compared to 
other boundary sites that are deep inside a 'fjord'. As 
a result, the faster growing parts of the cluster bound- 
ary shield or screen other boundary sites from further 
growth. 

To understand this dynamical screening effect more 
quantitatively, it is desirable to construct a simpler model 
which incorpoartes the screening effect and yet is analyti- 
cally tractable. Bradley and Strenski [7] introduced such 
a model where particles undergo a directed diffusion lim- 
ited aggregation (DDLA) on a Cayley tree. Physically 
this corresponds to the situation when there is a strong 
external field such as the gravity or an electric field which 
forces the particles to choose an overall direction of mo- 



tion. In this DDLA model, one starts with a Cayley tree 
of height I (see Fig. 1) where all the 2 l — 1 sites of the 
lattice are initially empty. Then a particle is introduced 
at the top site and it performs a directed (downwards) 
random walk (from any site it choooses one of the two 
daughter sites at random and moves there) till it reaches 
one of the bottom leaves and can descend no more. It 
then occupies that leaf site. 




FIG. 1. A typical history of the DDLA process till satura- 
tion on a Cayley tree with height I = 4. The occupied nodes 
contain a black filled circle inside them and the number next 
to an occupied site denotes the particle number. 

Then a second particle is introduced from the top site 
and it also performs a directed random walk. This sec- 
ond particle will stop if it happens to reach a site such 
that atleast one of the daughter nodes of that site is al- 
ready occupied. It can not descend any more and it rests 
at that site for all subsequent times. Note that in this 
model, each site can contain at most one particle. Then 
a third particle is released from the top and so on. Basi- 
cally, after reaching any site, say the i-th site, a particle 
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attempts to hop down to one of the two daughter nodes 
of i and it actually moves to the target site provided both 
the daughter nodes of i are empty. If atleast one of them 
is occupied, the particle can not descend any further and 
it rests at site i forever. Then the next particle is added 
and the process continues until no more particles can be 
put in, i.e. when the top site gets occupied. The tree 
is then said to be saturated. One such history of the 
process till its saturation is shown in Fig. 1. A typical 
snapshot of the saturated tree (see Fig. 1) shows that 
the cluster has voids of various sizes. A useful quantity 
to characterize the pattern of the cluster is the total num- 
ber of particles ni in the saturated tree. Clearly n; is a 
random variable, varying from one history of the process 
to another. The quantities of central importance in this 
problem are the average density, pi = {ni)/[2 l — 1] and the 
fluctuations of m around its average value. How do these 
two quantities behave asymptotically for a large tree, i.e. 
when I — > oo ? 

While it was easy to write down the basic recursion 
relations (see later in Section II) for certain probabili- 
ties associated with the DDLA process on a tree, they 
turned out to be nonlinear [7] and hence it was difficult 
to determine even the asymptotic behavior of pi for large 
I. Bradley and Strenski studied the recursion relations 
numerically and found, somewhat unexpectedly, that pi 
decays slower than exponentially with I for large I but the 
precise nature of this decay was not evident from their 
numerics [7] . Later Aldous and Shields [8] studied via rig- 
orous probabilistic methods a completely different model 
namely a continuous time version of the so called digital 
search tree (DST) problem relevant in computer science 
[9,10,8,11-14]. As we will see later in Section- VI, these 
two models namely the DDLA and the DST share the 
same recursion relation, though for very different quan- 
tities. The rigorous results of Aldous and Shields [8], 
when translated back in the DDLA language, would in- 
dicate a streched exponential decay for the average den- 
sity, pi ~ 2 _v ^ for large I. Recently a more refined 
result on DST was derived by Knessl and Szpankowsky 
[15]. Unaware of the DDLA or the DST model, Hastings 
and Halsey also studied independently a related model 
recently [16] and used extremal arguments to conclude 
the same streched exponential decay for the average den- 
sity. 

The methods used by the mathematicians, though rig- 
orous, lack physical transparency. On the other hand, 
the extremal arguments used by Hastings and Halsey, 
though physically intuituive, are heuristic. Moreover, it 
is not easy to derive quantitative results regarding fluc- 
tuations of the number of particles ni via these methods. 

For example, how does the width w{l) = \J (nf) — (ni) 2 
behave as a function of 17 Besides, none of these methods 
are easily adaptable to extract the leading asymptotic be- 
haviors in more general models such as one that we will 



consider in this paper. Our approach in this paper would 
be to use the powerful techniques (suitably adapted for 
our problem) of traveling fronts, originally developed in 
the context of nonlinear reaction-diffusion systems and 
population dynamics [17-19]. The techniques of travel- 
ing fronts have found a host of very useful applications 
[20] . Recently we have pointed out that in many extreme 
value problems in both physics and computer science, one 
can successfully use the traveling front techniques to de- 
rive exact asymptotic results for the statistics of the ex- 
ternum of a set of correlated random variables [21-24]. 
The present paper points out yet another useful applica- 
tion of the traveling front techniques, namely in a gener- 
alized DDLA problem with relevance to a class of search 
tree problems in computer science. 

The traveling front method, though technically not rig- 
orous in the strict mathematical sense, has the following 
advantages over the other methods used in the DDLA 
problem: (i) this method is not model specific, is more 
general and is easily adapdable to more general models 
such as the ones that will be studied in this paper, (ii) 
it is easy to implement and is physically transparent and 
(iii) it provides a very cheap way to extract the lead- 
ing asympotic behavior exactly without using too much 
mathematics. Besides rederiving the known results in 
the standard DDLA problem, this method also allows 
us to derive many new results in more generalized mod- 
els. For example, we show that in the DDLA model, 
the random variable ni approaches to its average value, 
n —* (ni) in the I — > oo limit. This is the example of 
the extreme concentration of measure, i.e., the distribu- 
tion of ni tends to a delta function. In particular, we 
show that the width of the distribution decays slowly as 
a power law, w(l) ~ Z -1 / 2 as I — > oo. Furthermore, we 
point out the close connection between the DDLA prob- 
lem and the DST problem widely studied by computer 
scientists [9,10,8]. The later problem is also related to the 
well known Lcmpel-Ziv algorithm used in data compres- 
sion [25]. Some of the results derived in this paper will 
constitute new results in these computer science prob- 
lems. 

The rest of the paper is organized as follows. In the 
next section we introduce a generalized b-DDLA model 
(where & is a positive integer). The original DDLA model 
of Bradley and Strenski is a special case of this b-DDLA 
model with 6=1. We then derive the asymptotic statis- 
tics of the number of particles in the b-DDLA model 
using the traveling front technique, suitably adapted for 
this model. In Section III, we generalize these results 
to the case when the Cayley tree has m > 2 branches. 
Section IV considers the DDLA model with a bias when 
a particle has more probability to go to the left branch 
compared to the right one. In Section V, we point out 
the detailed connections between the DDLA model, the 
DST problem in computer science, and the Lempel-Ziv 
parsing algorithm and discuss the implications of our rc- 
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suits for the generalized b-DDLA model in the context 
of computer science. Finally a brief summary and a con- 
clusion along with a list of open problems are offered in 
Section VI. 



II. THE B-DDLA MODEL AND ITS TRAVELING 
FRONT SOLUTION 

Here we introduce a generalized b-DDLA model where 
the 'hard' screening of the usual DDLA model is 'soft- 
ened' in the following sense. As in the usual DDLA 
model, one starts with an empty Caylcy tree of height 

I and the particles are introduced sequentially at the top 
site and they go down the tree one at a time by perform- 
ing a random walk. However, now each site can contain 
at most 6 particles where 6 is a positive integer. During 
its journey downward, when a new particle arrives at an 
empty site, say the i-th site, it tries to move to one of 
the daughter nodes of i chosen at random. It actually 
moves to the target site provided both the daughter sites 
contain less than b particles. If cither of them contains b 
particles, i.e. is completely full, then the incoming parti- 
cle can not move down any further and it then stays at 
site i forever. Thus, in this model, a site can incorporate 
'screening' if and only if it has full capacity, i.e. when it 
has b particles. Otherwise it fails to screen. This model 
thus mimics the physical situation when one single par- 
ticle is incapable of stopping an incoming particle to go 
down, but the screening comes into play only as a collec- 
tive effect when there are b particles in the site. This is 
like a tunnelling effect, where a particle can go through 
a barrier provided the barrier is not too high. However, 
the rate of tunnelling goes to zero when the barrier height 
crosses a threshold. In our model, the parameter b acts 
like the threshold value. Clearly, for b = 1 this model 
reduces to the original DDLA model studied by Bradley 
and Strenski [7]. 

As in the 6=1 case, the tree is going to be satu- 
rated after a finite number of particles have been added 
to it. This happens when the top site contains b par- 
ticles. No further particles can then be put in. The 
number of particles ni required to saturate the tree of 
height I is clearly a random variable, fluctuating from 
one history of the process to another. The main ques- 
tion we address is: what is the statistics of ni for large 

II In particular, we would compute the average density 
at saturation, pi = (n;)/[2 J — 1] and the width of the 

distribution, w(l) = \J (nf) — (ni) 2 for large I. 

Following Bradley and Strenski [7] for the 6=1 case, 
we define Gi(n) to be the probability that the tree of 
height I is not saturated after the addition of n particles, 
i.e. the top site is not yet filled by 6 particles after n 
particles have been added to the tree. It is easy to see 
that Gi{n) satisfies the following recursion relation, 



G l+1 (n + b) = ^11 (2)G l {n 1 )G l {n-n 1 ), (1) 

for all Z > 1 and n > 0. It is useful to think of I as 'space' 
and n as 'time'. The Eq. (1) is supplemented with the 
'boundary' condition, G\{n) = 1 for all < n < (6 — 1) 
and G\{n) = for all n > b and the 'initial' condition, 
Gi(n) = 1 for < n < (6— 1) for all I > 1. The recursion 
relation in Eq. (1) is easy to understand. Suppose we 
have added (n + 6) particles to a tree of height (I + 1) 
(the left hand side of Eq. (1)). The condition that the 
top site is not yet filled by 6 particles indicates that be- 
fore the addition of the last 6 particles, the two daughter 
nodes of the top site must have both remained unsatu- 
rated. This is becuase, if either one or both of them had 
been saturated after the addition of n particles, then any 
further added particle would not be able to go down and 
would rest at the top site, and hence the top site would 
then get saturated after the addition of (n + 6) particles. 
The two daughter nodes are the roots of two uncorre- 
cted subtrees, each of height I. Hence the probability 
that both remain unsaturated is given by their product. 
Also, the number of particles n\ that enter, for example, 
to the left subtree (out of a total number of n particles 
that enter both subtrees) must have a binomial distribu- 
tion, thus explaining the right hand side of Eq. (1). 

Note that for fixed n > 0, the probability Gi(n) — > 
as I — > 1 and Gi(n) — > 1 as I — > oo. For later analysis, it 
turns out to be convenient to define the complementary 
probability, Fi (n) = 1 — G; (n) , that has the opposite be- 
havior as a function of I, namely F/(n) — > 1 as I — > 1 and 
Fi(n) — > as I — > oo for any fixed n > 0. The quan- 
tity Fi (n) denotes the probability that the tree of height 
I gets saturated before n particles are added. From Eq. 
(1), one finds that Fi(n) satisfies the recursion, 

1 n ( \ 

-^EQ^^-nO, (2) 
m=0 ^ ' 

with the boundary condition, F\(n) — for < n < 
(6 — 1) and Fi(n) = 1 for n > b and the initial condition, 
Fi(n) = for < n < (6 - 1) for all I > 1. It is useful to 
think by fixing the 'time' n while varying the 'space' /. 
Clearly Fi(n) — > as I — > oo, since almost surely a tree of 
infinite height will not be saturated before the addition a 
fixed, finite number n of particles. On the other side, for 
fixed n, Fi{n) — > 1 as I — > 0. For a given fixed n, as one 
increases I from to oo, the function F ; (n) starts off at 
the value 1 at / = and then drops off to beyond some 
characteristic length scale l*(n), as shown schematically 
in Fig. (2). As n increases, this characteristic length 
scale l*(n) also increases (see Fig. (2)), thus giving rise 
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to a traveling front structure with the front located at 
l*(n). In fact, we will see later that in the limit of large I 
(when one can treat I as a contiuous variable) and large 
n, the width w(n) of the front goes to zero, indicating 
that asymptotically the function Fi(n) becomes a Heav- 
iside theta function, Fi(n) — > 9(1* (n) — I). 



a, 
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FIG. 2. Schematic behavior of the probability Fi(n) as a 
function of I for different fixed values of n. The three curves 
correspond to different values of n increasing from left to right. 
The y axis is dimensionless while the x axis has arbitrary 
units. 

Note that Eq. (2) is nonlinear and hence is dif- 
fcult to solve exactly. However, the exact asymptotic 
informations regarding the position l*(n) of the front 
and its width can be deduced by adapting the travel- 
ing front techniques that were originally devised to deal 
with nonlinear partial differential differential equations in 
reaction-difffusion systems [17] and pupulation dynamics 
[18,19]. The basic idea behind this approach is very sim- 
ple. If there is a front l*(n), then ahead of the front 
I > l*(n), Fi(n) is very small and hence one can neglect 
the nonlinear term (the second term) on the right hand 
side of Eq. (2) and one simply gets a linear recursion, 



1 ™ 



711=0 



Fi(m). 



(3) 



Suppose one could solve this linear equation exactly sat- 
isfying the required initial condition. Now one expects 
that the solution of the linear equation (3) and the 'real' 
solution of the nonlinear equation (2) will coincide in 
the regime beyond the front, i,e. for I > l*(n). On the 
other hand, the two solutions will start differing from 
each other as one decreases I below l*(n), where the so- 
lution of the nonlinear equation (2) will tend to 1 where 
as the solution of the linear equation (3) will grow be- 
yond 1 with decreasing / (as there is no nonlinear term 
to control the solution). Thus, as one decreases I from in- 
finity, the front position l*(n) is approximately the value 
of I at which the solution of the linear equation becomes 



~ 0(1). Thus, according to this approach, one first 
solves the linear equation (3) to obtain i 7 i(n)|ii noar and 
then reads off the front position l*(n) from the condi- 
tion, i^»(„)(n) | linear ~ 0(1). By 0(1), one means that 
at I = l*(n), the solution Fi(n) should not diverge or 
decay exponentially with increasing n. Note that this is 
a slightly generalized version of the usual traveling front 
method [20,21] where one usually has a linear operator 
with constant coefficients which admits an exponentially 
decaying solution of the form exp[— X(x — vt)] with con- 
stant width. The present approach is more general and 
works even when the linear operator does not admit an 
exponentially decaying solution with a constant width. 

Under this traveling front approach , our task thus re- 
duces to solving the linear equation (3) which, however, 
is still nontrivial. To proceed, we define a somewhat un- 
usual generating function, 



oc 1 

71 = V ' 



(4) 



Using Eq. (3) one can then show that Fi(s) satisfies a 
rather simple recursion in I, 



Fi+i(s) 



(i + s y 



;Fi(2s), 



(5) 



for all I > 1. The steps leading to Eq. (5), starting from 
Eqs. (3) and (4), arc not completely straightforward. 
Hence we present this derivation in the Appendix. The 
recursion in Eq. (5) starts with the initial value F\ (s) 
which needs to be calculated separately. Noting that 
Fi(n) = for < n < (b- 1) and iq(n) = 1 for n > b, we 
find from the definition in Eq. (4), F\(s) = l/[s(l + s) b ]. 
Iterating Eq. (5) and using the expression for I = 1, we 
get 



tl-i 



Fi(s) = 



s[(l + s)(l + 2s)...(l + 2'" 1 s 



■ exp 



l-i 



-6^1n(l + 2 fe .s) 



k=0 



(6) 



for all I > 1. 

We then write the sum inside the exponential in Eq. 
(6) in two parts using the Euler-Maclaurin summation 
formula, S(l, s) = £L=o In (1 + 2 k s) = 1(1, s) + R(l, s) 
where 1(1, s) = f Q In (1 + 2 x s)dx and R(l, s) = S(l, s) - 
1(1, s) is the residual term. The integral 1(1, s) can be 
done and one gets, 1(1, s) — [Y(2 l s) — Y(s)]/ln2 where 
the function Y(z) is given by 



Y(z) = 



E^-i)"- 1 *"- 1 "- 

iln 2 z-ln(l + l/z)- 



ln 2 + 71-712 



for z < 1, 
for z > 1. 

(7) 
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The structure of these expressions suggests a natural scal- 
ing limit, s — > 0, I — > oo but keeping the product z = s2 l 
fixed but arbitrary. We also treat I as a continuous vari- 
able in this limit. Furthermore, we focus only near the 
tail of the scaling regime, i.e. when z » 1. In this 
regime, it is sufficient to keep only the first term in the 
second line of Eq. (7) for the expression of Y(z). Besides, 
one can also check that the residual term is sublcading 
in this regime. Keeping only the leading terms we get 



F t (s) w -exp 
s 



In 2 



^ + log 2 (*)) 2 



(8) 



We still need to invert the generating function in Eq. (4) 
to obtain the asymptotic behavior of Fi(n). The scaling 
limit corresponds to taking n — > oo, I — > oo but keeping 
the ratio 2 l /n fixed but arbitrary. Using Eq. (8) and 
inverting Eq. (4) (using the Bromwich inversion formula 
and then using the standard steepest descent method), 
we find the following leading asymptotic behavior, 



Fi (n) w exp 



In 2 



/-^(/-log 2 (n)f 



(9) 



valid in the tail I >> log 2 (n). 

Having obtained the asymptotic solution (9) of the lin- 
ear equation (3), the location of the front l*(n) can be 
read off from the equation, Fp (n)\\i neaT ~ 0(1). Using 
this criterion in Eq. (9), we find that the front position 
l*(n) is given by the quadratic equation, 



r--[r-i og2 (n)] 2 = o. 



(10) 



As we decrease I from oo, we will encounter the larger 
root first, which will correctly locate the front position. 
From Eq. (10), we get, for large n, the asymptotic front 
position 



l*(n) w log 2 (n) + \j -log 2 (n). 



(11) 



Furthermore, substituing I 
and expanding for small y, 

Fi(n) ~ exp — v /261n(2) ln(n) (I 
that the width of the distribution, characterizing the flue 
tuation of I around its average value l*(n), decreases ex- 
tremely slowly with n as, 



= l*(n) + y in Eq. (9) 
we find to leading order, 
l*(n)) , indicating 



w(n) w n , 



(12) 



as n — > oo. The fact that the width vanishes in the n — > 
oo limit shows that the probability Fi(n) — > 9(1* (n) — I), 
thus indicating an extreme concentration of measure, i.e. 
the random variable I — > l*(n). 

In the above analysis, we kept n fixed and studied the 
behavior of Fi(n) as a function of I. Alternately, as is 
more suited for the DDLA problem, one can keep I fixed 



and vary n. It follows from Eq. (9) that in the scaling 
limit mentioned above, the random variable log 2 (n) ap- 
proaches to its mean value (log 2 (n)) = I — \/2l/b. Due 
to the extreme concentration of measure, it follows then 

that n -» (ni) w 2 l ~V^ in the scaling limit. This 
means that the average density pi = (n)/(2 l — 1) decays 
as a stretched exponential for large I, 



P(l) 



(13) 



Besides, substituting log 2 (n) = (log 2 (n)) — y\ in Eq. 
(9) and expanding for small y\, we find, Fi(n) w 



exp — ln(2)V2W yi ■ This indicates that as a function 
of I, the width of the random variable log 2 (n) around 



its average value (log 2 (n)) 
braically for large I 

w(l) 



^2b\n 2 (2)l 



I — \j2ljb decreases alge- 



(14) 



Note that the leading order behaviors of the widths in 
Eqs. (12) and (14) are compatible with each other with 
the identification n w 2 l . The Eqs. (11), (12), (13) and 
(14) constitute the main results of this section. 



III. GENERALIZATION TO A TREE WITH M 
BRANCHES 

In this section we generalize our results for the b- 
DDLA in the previous section (obtained for a tree with 
m = 2 branches) to a tree with m > 2 branches. In 
this case, during its downward journey from the top, a 
particle from a given site attempts to hop to any of the 
m daughter nodes with equal probability 1/m and can 
actually hop to the target site provided none of the m 
daughter nodes is full with b particles. If it fails to hop, 
the particle stays at its current site for all subsequent 
time. The probability Gi(n) that a tree of height I is yet 
to be saturated after the addition of n particles satisfies 
the generalized recursion relation, 



m m 



Gt( ni ) 



m—0 i—i 



mi 



(15) 



where the variables rij's satisfy the constraint 1 rij = 
to. Thus the binomial coefficient in Eq. (1) of the previ- 
ous section gets replaced by a multinominal. The rest of 
the analysis is straightforward and similar to the previous 
section. We define as usual, the complementary proba- 
bility, Fi(n) = 1 — Gi(n), which satisfies the recursion, 



n< 



+ nonlinear terms, 



Fi( ni ) 



(16) 
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where we have used the symmetry that all branches are 
similar to each other. 

As before, we solve the equation (16) retaining only 
the linear terms and neglecting the nonlinear terms. We 
define the generating function as in Eq. (4). Following 
the derivation presented in the Appendix and using the 
initial condition, we get the solution 



Fi(s) = 



rn 



l-i 



s[(l + s)(l + ma) . . . (1 + m l - 1 s)} 



b ' 



(17) 



The asymptotic analysis is exactly similar to the previ- 
ous section, except that the proper scaling limit now is 
s — > 0, I — > oo but keeping the product sm l fixed but 
arbitrary. We do not repeat the steps here but present 
only the final results. We find that as in the m = 2 case, 
there is a front whose asymptotic location l*(n) is given 
by 



l*(n) «log m (n) 



(18) 



and in the limit n 
vanishes slowly as 



oo, the width w(n) of the front 



w(n) w 1/^/26 ln(m)ln(n). 



(19) 



Similarly, we find that for fixed but large I, the average 
density varies as a strectched exponential, 



(20) 

and the width w(l) of the random variable log m (n) 
around its average value (log m (ra)) = Z — \/2l/b decreases 
algebraically for large I, 



w(l) 



^2b\n(m)l 



(21) 



Note that, interestingly, the asymptotic average value 
(log TO (n)) = I — \j2ljb is actually independent of m. 



IV. B-DDLA MODEL WITH BIASED HOPPING 

In this section we consider the b-DDLA model on a 
m = 2 tree where the particles perform biased random 
walk on their way down the tree. More precisely, when a 
particle arrives at any given site i on its way down after 
being introduced at the top site, it attempts to hop to 
the left daughter of the node i with probability p and to 
the right daughter with probability q = 1 — p. As be- 
fore, it actually moves to the target site provided both 
the daughter nodes have less than b particles. If at least 
one of them is full with b particles, then the particle rests 
at site % for all subsequent times. Then a new particle 
is added and the process continues till the top site gets 



filled with b particles. Once again, we are interested in 
the statistics of the number of particles n; when the tree 
of height I gets saturated. We define, as before, Gi(n) 
to be the probability that the tree of height I remains 
unsaturated, i.e. the top site remains unfilled upto the 
addition of n particles. Following the same logic as in 
Section II, one easily finds the recursion relation, 



G l+1 (n + b)= J2 L )p ni g n - ni Gj(»i)Gi(n-ni), 



(22) 



for all Z > 1 and n > 0. 

The complementary probability, Fi(n) = 1 — Gi(n), 
then satisfies the recursion, 



F l+1 (n + b)= ]T f Jf/V- ni [Fi{ni) +Fi(n- m)] 



+ nonlinear terms, 



(23) 



with the boundary condition F\(n) = for < n < 
(b — 1) and F\(n) = 1 for n > b. As before, we solve 
the equation (23) keeping only the linear terms and ne- 
glecting the nonlinear terms. This is done via defining 
the generating function Fi(s) as in Eq. (4). Following 
the same line of derivation presented in the Appendix, 
we obtain the following recursion relation, 



Fi+i(s) 



p(l + s) b 



Fi(s/p) 



1 



5(1 + s) 



;Fi(s/q), (24) 



which starts from the initial function, Fi(s) = l/[s(l + 
s)]. One can, in principle, iterate Eq. (24) starting with 
Z = 1 and obtain the expressions for Fi(s) for all Z. For- 
tunately, in the scaling regime s — ► 0, one does not need 
the full expression for Fi(s). Note that in the unbiased 
case p = q = 1/2, the appropriate scaling regime was 
s — > 0, Z — > oo but keeping the product z = s2 l fixed but 
arbitrary. In the biased case, it is clear from Eq. (24) 
that the appropriate scaling regime will be set by taking 
s — > 0, Z — > oo but keeping the product z = scr 1 fixed 
where o = min(l/p, 1/q). Thus in this scaling limit, one 
can approximate Eq. (24) by 



Fi+i(s) 



+ 



-Mas). 



(25) 



The terms neglected in going to Eq. (25) from Eq. (24) 
only contribute to subleading order. Iterating the re- 
duced Eq. (25) starting with Fi(s) = l/[s(l + s)], one 
obtains 



Fi(s) 



s [(1 + s)(l + as)(l + <t 2 s) . . . (1 + a l - 



(26) 
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We then invert the transform in Eq. (26) using the 
same asymptotic method as in Section II, the details of 
which we do not repeat. The final asymptotic form of 
the distribution Fi(n) is given by, 



Ft (n) s=s exp 



ln(a) 



{l-\og a {n)Y 



(27) 



valid in the scaling regime, n — ► oo, I — > oo but with the 
ratio y = 2'/n fixed at a large value y » 1. The front 
position can be read off from the condition Ft* (n)|ii nea r ~ 
0(1) which gives, to leading order, l*(n) w log CT (n). Note 
the difference with the unbiased case. Unlike the unbi- 
ased case in Section II where the width vanishes for large 
n, here the width of the distribution remains of 0(1) in 
the large n — ► oo limit, w(l) — ► 1/Vhi a. This result 
also indicates that the average density of particles varies 
as (nt) fts g 1 for large /. Thus, unlike the unbiased case 
where the average density decays as a stretched expo- 
nential for large I, the average density in the biased case 
decays exponentially for large I, 



p{l) « (a/2) 1 , 



(28) 



where a = rnin(l/p, 1/q). Besides, it follows from Eq. 
(27) that the fluctuations of the variable log CT (n) around 
its average value I are characterized by a Gaussian tail 
with width of O(l). 



V. CONNECTION TO DIGITAL SEARCH TREES 
AND THE LEMPEL-ZIV PARSING ALGORTHIM 

In this section we point out the connection between our 
generalized b-DDLA model to the so called digital search 
tree problem in computer science [9,10,8,11-13,26] which, 
in turn, is also related [14] to the Lempel-Ziv data com- 
pression algorithm [25]. Suppose we have a data string 
{xi,X2, ■ ■ ■ ,x n } which needs to be stored on a binary 
tree. According to the digital search tree (DST) algo- 
rithm, one proceeds as follows. Initially all the nodes of 
the tree are empty. The first arriving element x\ is put 
at the root of the tree. Each node can contain at most 
one element. The second element xi is put at one of the 
daughter nodes of the root chosen at random. Then for 
the next element X3, one again starts at the root and 
choses one of the daughter nodes at random. If the cho- 
sen node is empty, x% goes there. But if the chosen node, 
say i, happens to be the one that contains 22, then one 
chooses one of the two daughter nodes of i at random 
and puts X3 there. Then one stores the fourth element 
X4 following the same algorithm and so on. Essentially 
each element Xi performs a directed random walk down 
the tree till it finds an empty site which it then occu- 
pies. The process stops when all the n elements have 
been stored and the resulting tree is called a DST (see 
Fig. 3). Note that according to this DST algorithm, the 



actual value of a data element say Xi is not important. 
This is contrast to other search trees, such as the random 
binary search trees [9,10] where the actual value of Xi is 
used in constructing the tree. 




FIG. 3. A typical digital search tree constructed from a 
data string {xi, X2, £3, £4, £5} of 5 elements. 

The statistics of various quantities such as the distri- 
bution of the number of occupied nodes at a given depth 
(known as the profile of the DST) have been studied in 
great detail in the computer science literature [14]. Fla- 
jolet and Richmond [12] introduced a generalized version 
of the DST where each node can contain at most b ele- 
ments. In this b-DST, an incoming element x% performs, 
as in the b = 1 directed random walk downwards. 

However, when Xi reaches a new site, say k, it will stay 
forever at k provided the number of already existing ele- 
ments at k is less than b. If the site k already contains b 
elements, then one chooses one of the daughter nodes of 
k and the element Xi hops there. This generalized b-DST 
problem has many applications in computer science, no- 
tably in the maintenance of paged hashing tables [12]. 
Flajolet and Richmond studied, for example, the average 
number of non-empty nodes in a b-DST as a function of 
the data size n and the parameter b. 

One important characteristic of a b-DST is its height. 
The height I of a tree with n elements is defined as the 
depth, counted from the root, of the farthest element in 
the tree. Clearly I is a random variable, fluctuating from 
one realization of the tree to another and also it is an 
extreme variable (denoting the maximum depth) . A nat- 
ural question is: what is the probability distribution of 
the height? Let us define Qi (n) to be the probability that 
the height of tree with n elements is < I. It is easy to see 
that Qi (n) satisfies the following recursion relation 

Q l+1 (n + b) = ^J2 (^Wni)Qi(n-ni), (29) 

ni=0 ^ ^' 
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for all Z > 1 and n > 0, with the additional condition 
that Qi(n) = 1 for all < n < b and Qi(n) = 0. For the 
case b = 1, this recursion relation was recently studied 
by Knessl and Szpankowsky [15] using rigorous methods. 
The recurrence in Eq. (29) is a generalized version of the 
6=1 case and can be understood as follows. Consider a 
tree with a total number of (n + b) elements. The first b 
elements will be stored in the root and the rest of the n 
elements will be distributed to the left and right daughter 
subtrees. The probability that one of the subtrees, say 
the left one, gets n\ elements out of a total n elements 
is simply given by the binomial distribution. Also, since 
the condition that the height of the full tree is < I + 1 
(the left hand side of Eq. (29)) indicates that the height 
of both of the daughter subtrees must be < I. Since the 
two daughter subtrees are completely independent, this 
probability is given by their product. 

Note that the recursion relation in Eq. (29) for the 
height distribution Qi (n) in the b-DST is identical to the 
recursion in Eq. (1) in Section II for the probabilities 
Gi(n) in the b-DDLA problem, except for the slight dif- 
ference in the initial values Qi(n) and Gi(n). This slight 
difference does not affect the asymptotic behaviors. So, 
one can apply all the results obtained via the traveling 
front approach in Section II for the b-DDLA model di- 
rectly to the b-DST problem. In particular, the result in 
Eq. (11) indicates that the average height of the b-DST 
has the asymptotic following behavior for large n, 



r(n)wlog 2 (n) + y-Iog 2 (n). (30) 

For b = 1, this result coincides with that of Aldous and 
Shields [8] obtained by probabilistic methods. Note that 
for b = 1 case, a more refined result including additional 
subleading terms to Eq. (30) was recently obtained in 
Rcf. [15] using rigorous methods. However, for general 
b, we are not aware of any rigorous results in the com- 
puter science literature and our Eq. (30) seems to be the 
first result for the average height of a b-DST. Further- 
more, Eq. (12) in Section II predicts that the standard 
deviation of the height around its average value decays 
extremely slowly with large n, w(n) w l/^/2bln(2) ln(ra). 
This result on the variance of the height in b-DST also 
seems not to have been obtained by other methods be- 
fore. 

We now turn to the Lempel-Ziv algorithm for data 
compression [25]. The connection between this algo- 
rithm and the DST problem was known before [8,14]. 
The Lempel-Ziv algorithm is central to many universal 
data compression schemes and have many applications 
such as in the efficient transfer of data [14]. This ba- 
sic scheme of this algorithm is very simple: it takes a 
given data string, say a sequence of binary digits such 
as 11000110111011110, and partitions it into 'words'. 
'Words' are subsequences of variable sizes which are never 



repeated and are constructed by employing the rule that 
a new 'word' is the shortest subsequence not seen in the 
past as a 'word'. This is best understood by an example. 




FIG. 4. The figure shows how 

the partitioning of a sequence 11000110111011110 into 'words' 
( 1) (10) (0) (01) (101) (1 1) (01 1) (1 10) using the Lempel-Ziv pars- 
ing algorithm can be represented as a digital search tree. The 
length of a 'word' is equal to its depth in the tree measured 
from the empty root at the top. 

Consider the binary sequence 11000110111011110 and 
construct 'words' starting from the left end using the 
Lempel-Ziv algorithm. Starting from the left end, the 
first digit encountered is 1. Since 1 has not occurred be- 
fore as a 'word', one can form the first 'word' (1). Now 
we move to the next element which also happens to be 1. 
But, now since (1) is already a 'word', the shortest seg- 
ment we can use to form a 'word' is (10). Similarly, the 
next word would be (0) since (0) has not occurrred before 
as a 'word'. One keeps repeating the procedure and at the 
end, the original sequence is partitioned into the following 
sequence of 'words': (1)(10)(0)(01)(101)(11)(011)(110). 
The original data is thus compressed into these words. 
Even though the 'words' are relatively short in the be- 
gining, it turns out that they become bigger quite rapidly. 
One of the interesting questions of practical importance 
in this scheme is the statistics of the longest 'word' when 
the original data string is random. For concreteness, let 
us consider a random binary sequence of initial data and 
focus on the first n 'words'. Let I be the length of the 
longest 'word' amongst these n words. Clearly I is a ran- 
dom variable since the underlying binary sequence is ran- 
dom. We are interested in the statistics of I as a function 
of n. 

There is a natural representation of this parsing algo- 
rithm in terms of a DST [14]. Consider a binary tree 
whose nodes are initially empty. In fact, the root of this 
tree is always going to be empty (see Fig. 4). Now we 
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take the first of the Lempel-Ziv parsed 'words' and ex- 
amine its first digit. If the first digit of this 'word' is 
1, we put this 'word' in the right daughter node of the 
root. On the other hand, if the first digit is 0, we put 
this word at the left daughter node. This newly occu- 
pied node is now full and can not accomodate any other 
'word'. Then we consider the second 'word' and look at 
its first digit. If the first digit is 1 (0), we go to the right 
(left) daughter node. Let us call this node i. If this node 
i is empty, we put the 'word' there. If i is already oc- 
cupied by the first 'word', then we need to examine the 
second digit of our second 'word' and depending on its 
value (1 or 0), we go respectively to the right or to to 
the left daughter node of i and put our second 'word' at 
this new site. This process is repeated until all the words 
are stored and the resulting tree is clearly a DST, since 
at each step the decision to go to the left or to the right 
occurs randomly (due to the randomness of the underly- 
ing binary sequence where each digit can be either or 
1 with equal probability). The construction of this DST 
from the parsed words (1)(10)(0)(01)(101)(11)(011)(110) 
is shown in Fig. 4. 

It is clear from the algorithm that the depth of a given 
'word' in the DST (measured from the empty root) is pre- 
cisely the equal to the length of the 'word' (see Fig. 4). 
In particular, the longest 'word' will also be the farthest 
from the root. Thus the length I of the longest word is 
precisely the height of the corresponding DST. There is a 
generalized Lempel-Ziv algorithm where during the par- 
titioning into 'words', any particular 'word' is allowed to 
be repeated at most b times [14]. Then the corresponding 
DST is precisely a b-DST. Thus our results regarding the 
average height l*(n) and its width apply as well to the 
longest 'word' in the generalized Lempel-Ziv algorithm. 

VI. CONCLUSIONS 

In this paper we have used a suitably adapted version 
of the traveling front approach to derive exact asymptotic 
results for the statistics of the number of particles in a 
generalized directed diffusion limited aggregation prob- 
lem. We have pointed out a close connection of this 
problem to two separate problems in computer science, 
namely the digital search tree problem and the Lempel- 
Ziv algorithm used for data compression. Our results 
for the number of particles in the generalized b-DDLA 
model have direct relevance to the statistics of height in 
the digital search tree problem and to the statistics of 
the longest word in the Lempel-Ziv algorithm. 

The traveling front approach has recently been used 
successfully [22,23] to derive exact asymptotic results for 
heights in a number of growing search tree problems in 
computer science. This paper shows that the scope of this 
approach can be extended to include yet another differ- 
ent class of search trees, namely the digital search tree. 



The main advantage of this method is that it provides 
an easy way to derive the leading asymptotic behavior 
exactly in a variety of extreme value problems [21]. 

The present study leads to a number of interesting, 
open problems which we list below. 

Undirected DLA problem on a tree: In this paper, we 
have focused on a directed model for simplicity, where the 
particles undergo diffusion but only in the overall down- 
ward direction. It would be interesting to extend the 
traveling front approach to an undirected model such as 
the one studied by Hastings and Halsey [16]. 

Subleading Corrections in the biased DDLA problem: 
Using traveling front approach, we managed to calculate 
only the leading behavior of the average density in the 
biased DDLA model (see Section IV). It would be in- 
teresting to compute the subleading corrections to this 
leading behavior. 

DDLA model with stochastic screening: In this paper, 
we have studied a DDLA model where the screening is de- 
terministic in the sense that a particle, on its way down- 
wards, stops definitely when it reaches at a site such that 
at least one of the daughter nodes of that site is occupied. 
It would be interesting to consider a stochastic screening 
version defined as follows. For simplicity, we define the 
model for b = 1 case, though it can be trivially gener- 
alized to b > 1 case also. The particles are introduced 
sequentially from the top as before and a new particle 
is introduced only when the previous particle has com- 
pletely stopped moving. On its way down, at each site i a 
particle performs the following steps: if both the daugh- 
ter nodes of i are already occupied, the particle rests at i 
for all subsequent times and then a new particle is added. 
If both the daughter nodes are empty, then the particle 
chooses one of the daughter nodes at random and moves 
there. If, however, only one of the daughter nodes (say 
the left one) is occupied but the other one (the right one) 
is empty, then with probability p (0 < p < 1) the particle 
moves to the right daughter node and with probability 
(1 — p) it rests at i for all subsequent times. Clearly 
the case p = corresponds to the deterministic screen- 
ing model studied in Section II. On the other hand, for 
p = 1, it is obvious that at the saturation the tree will 
be completely full and the density will be exactly 1. It 
would be interesting to compute the statistics of density 
in this stochastic screening model. 

Disordered b-DDLA model: In the present paper, we 
considered the b-DDLA model when all the nodes of the 
tree have the same capacity b. In a disordered version of 
the problem, this node capacity 6, of a site i may vary 
from one site to another. One can consider bi's to be a 
set of quenched variables (as in the usual models in dis- 
orderd systems) , each drawn independently from a spec- 
ified distribution p(b). Then, for a given fixed set of bi's, 
one would first like to compute, for example, the average 
number of particles (m) at saturation, and then average 
this quantity over the disorder to obtain (n;), where — 
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denotes the average over the bi's. It would also be inter- 
esting to compute the sample to sample fluctuations of 
the average density (n;). 
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APPENDIX: DERIVATION OF THE 
GENERATING FUNCTION 



where we have used the identity J °° e z z n dz — n! and 
the definition of Fi(s) in Eq. (5). Next we take the 
Laplace transform on both sides of Eq. (A4). Using the 
initial conditions for n = 0, one can show easily that 
d k Ui{z)/dz k \ z=0 = for all Z > 1 and k < (6 - 1). Using 
this condition and doing integration by parts, one finds 

£(f) S fe ^ +1 ( S )=4cM2 S ). (A6) 

k=0 ^ ' 

Summing the left hand side of Eq. (A6) and identify- 
ing Ui(s) = Fi(s) as in Eq. (A5) then gives the desired 
recursion relation 

^ +l(s) = 7IT^ (2s) - (A7) 



In this appendix, we present the derivation of Eq. (5) 
where Fi(s) is defined in Eq. (4). Our starting point is 
the linear equation (3). We first define the exponential 
generating function, 



n=0 



(Al) 



Multiplying both sides of Eq. (3) by z n /n\ and summing 
over n, it is easy to see that Hi(z) satisfies the 6-th order 
nonlocal differential equation, 



d b H l+1 {z) 
dz b 



= 2H l {z/2)e z / 2 , 



(A2) 



for all I > 1. This recursion in Eq. (A2) starts from 
the initial function H\{z) which needs to be computed 
separately. Using F\{n) = for < n < (6 — 1) and 
Fi(n) = 1 for n > 6, we find H^z) = ^=0 z k /k\. The 
next step is to define a new function, 



U l {z) = H l {z)e- z = y^ Fl {n)—e- z . 

TV. 



n=0 



(A3) 



From Eq. (Al), it follows, after a few steps of algebra, 
that Ui(z) satisfies the differential equation, 

fe=0 v 7 

for all I > 1 starting with the initial function, U\{z) = 
e- z E b k -J z k /k\. 

We now define the Laplace transform, Ui(s) = 
J °° Ui(z)e~ sz dz. Taking the Laplace transform in Eq. 
(A3), we get 



00 



n=0 



(A5) 
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